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DETAILED ACTION 
Claim Objections 

1 . Claims 9 and 21 objected to because of the following informalities: The examiner has 
interpreted "associated containing" as —containing—. Appropriate correction is required. 

Claim Rejections - 35 USC § 112 

2. The following is a quotation of the second paragraph of 35 U.S.C. 112: 

The specification shall conclude with one or more claims particularly pointing out and distinctly claiming the 
subject matter which the applicant regards as his invention. 

3. Claims 9 and 21 are rejected under 35 U.S.C. 112, second paragraph, as being indefinite 
for failing to particularly point out and distinctly claim the subject matter which applicant 
regards as the invention. The claims recite "automatically displaying" in the first line of the 
claims but it is unclear what is meant by "automatically." The examiner has interpreted 
"automatically displaying" to mean -displaying—. 

4. Claims K 9, 10, 12, 13, 2K 22 and 25 . recite limitation(s) for which there is insufficient 
antecedent basis for the limitation(s) in the claim(s), as follows: 

Claims 9 and 21 recite the limitation "said automatically displaying" in the first line of 
the claims. The examiner has interpreted "method of claim 5" to be —method of claim 2—; and 
"machine-readable storage of claim 18" to be —machine-readable storage of claim 14—. 

Claims 1,13 and 25 recite the limitation "the user" in lines 6, 8, and 6, respectively. The 
examiner has interpreted "the user" to be -a user-. 



Application/Control Number: 10/688,041 Page 3 

Art Unit: 2626 

Claim 12 recites the limitation "the text input." The examiner has interpreted "method of 
claim 10" to be -method of claim 1 1—. 

Claims 10 and 22 recite the limitation "the segment data set." The examiner has 
interpreted "method of claim 9" to be -method of claim 6-; and "machine-readable storage of 
claim 21" to be -machine-readable storage of claim 18~. 

Appropriate correction is required. 

Claim Rejections - 35 USC § 101 

5. 35 U.S.C. 101 reads as follows: 

Whoever invents or discovers any new and useful process, machine, manufacture, or composition of matter, or 
any new and useful improvement thereof, may obtain a patent therefor, subject to the conditions and 
requirements of this title. 

Claims 1, 13 and 25 are rejected under 35 U.S.C. 101 because the claimed invention is 
directed to non-statutory subject matter. Claims 1,13 and 25 recite a mathematical algorithm 
that does not provide a practical application that produces a useful, tangible and concrete result , 
(see "Interim Guidelines for Examination of Patent Applications for Patent Subject Matter 
Eligibility"), 

Claims 2-12 and 14-24 are also rejected for failing to cure the deficiencies of the above 
rejected nonstatutory claims 1,13 and 25. 

Claim Rejections - 35 USC §102 

6. The following is a quotation of the appropriate paragraphs of 35 U.S.C. 102 that form the 
basis for the rejections under this section made in this Office action: 

A person shall be entitled to a patent unless - 
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(b) the invention was patented or described in a printed publication in this or a foreign country or in pubhc use or on 
sale in this country, more than one year prior to the date of application for patent in the United States. 

7. Claims 1-3, 6, 9 13-15, 18-21 and 25 are rejected under 35 U.S.C. 102(b) as being 
anticipated by Yamazaki , US Patent 5,864,814 (published Jan. 26, 1999). 

Regarding claims K 13 and 25 , Yamazaki teaches a method for debugging and tuning 
synthesized audio (col. 22, line 28 - col. 26, line 28; Figs. 26-33), comprising the steps of: 

displaying a waveform corresponding to synthesized audio generated from concatenated 
phonetic units; and displaying original recording containing selected phonetic unit; (col. 23, 11. 
17-24, teaches "original waveform display window... synthesized waveform display window"); 

displaying parameters corresponding to at least one of the phonetic units (col. 23, 11. 41- 
49, "correlation between the parameters on the time axis"; col. 23, line 66 - col. 24, line 43, 
"pitch pattern Wl . . .pitch pattern W2 of the synthesized waveform. . .pitch label"; col. 25, 11. 1 7- 
32, "velocity indicating a volume; col. 23, 11. 41-49, "correlation between the parameters on the 
time axis"; col. 25, U. 61-67, "change of velocity... change of pitch... manual operation"; col. 26, 
11. 23-28, "parameter"; Fig. 33 illustrates the pitch/velocity for a certain phoneme, which 
provides a visual indication of the pitch/velocity values (displaying parameters)); 

receiving an editing input from the user (col, 25, line 61 - col, 26, line 16, "change of 
velocity... change of pitch... manual operation"); and 

adjusting the parameters in accordance with the editing input (col. 25, line 61 - col. 26, 
line 16, "value of pitch is changed"). 



Regarding claims 2 and 14 . Yamazaki teaches automatically displaying the parameters 
responsive to a user selection of at least a portion of the waveform, the displayed parameters 
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correlating to the selected portion of the waveform (col. 26, 11. 29-47, "editing. . .basically the 
same processing. . .file as an object for editing is selected. . .treated as an original waveform," if a 
user selects a file then he/she selects the entire waveform, at least a portion of the waveform, 
which is put on the display with the pitch, velocity, etc.). 

Regarding claims 3 and 15 . Yamazaki teaches identifying a portion of the waveform 
responsive to a user selection of at least one of the parameters, the identified portion of the 
waveform correlating to the selected parameters (col. 25, 11. 18-39, "velocity 
adjustment. . .velocity El in a time zone for the phoneme of 'ka' . . .subdivided to velocity Ell"; 
col. 25, 11. 61-67, "value of pitch. . .for each label"; Figs. 3 1-33; e.g. choosing the velocity for one 
of the phonemes to be changed identifies the corresponding phoneme/portion so that that portion 
of the waveform can be changed). 

Regarding claims 6 and 18, Yamazaki teaches wherein the edited parameters are 
contained in a segment dataset (Fig. 33; col. 25, 11. 40-52, "new filing"; col. 25, line 61 - col. 26, 
line 28, "change of velocity... change of pitch... for each label"' col. 26, 11. 29-53, "change of 
parameters in... original synthesized waveform... object for editing"; col. 23, 11. 1-16, "making 
new voice-generating information"; the information is a set of data for the voice segment 
(segment dataset), and is used to display the information in Fig. 33, and edits the information and 
the corresponding display). 



Regarding claim 1 9, Yamazaki teaches wherein the parameters comprise at least one 
parameter selected from the group consisting of a phonetic unit label, a phonetic unit boundary, a 
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pitch mark and a phonetic alignment (col. 23, 11. 50-62, "phoneme is set in each of spaces 
separated from each other with a label.. /yo'../de'"; col. 35, 11. 21-43, "pitch label"; Fig. 33; the 
pitch label provides an indication of pitch on the display (pitch mark); phoneme labels are 
aligned with the waveform sections (phonetic alignment)). 

Regarding claim 20 , Yamazaki teaches at least one action selected from the group 
consisting of deleting a pitch mark, inserting a pitch mark, repositioning a pitch mark and 
adjusting a phonetic alignment (col. 24, 11. 6-43, "deletion of a pitch label. . .adding a new 
label. . .movement"). 

Regarding claims 9 and 2K Yamazaki teaches [displaying] a recording's waveform 
[containing] the phonetic unit (col. 23, 11. 8-16, "natural voice is inputted... original waveform is 
displayed"; Fig. 33). 

Claim Rejections - 35 USC § 103 

8. The following is a quotation of 35 U.S.C. 103(a) which forms the basis for all 
obviousness rejections set forth in this Office action: 

(a) A patent may not be obtained though the invention is not identically disclosed or described as set forth in 
section 102 of this title, if the differences between the subject matter sought to be patented and the prior art are 

such that the subject matter as a whole would have been obvious at the time the invention was made to a person 
having ordinary skill in the art to which said subject matter pertains. Patentability shall not be negatived by the 
manner in which the invention was made. 



9. Claims 4. 10-12. 16 and 22-24 are rejected under 35 U.S.C. 103(a) as being unpatentable 
over Yamazaki in view of Mivatake et al 
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Regarding claims 4 and 16. Yamazaki teaches wherein the edited parameters are 
contained in a configuration file (Fig. 33; col. 25, 11.40-52, "new filing"; col. 25, line 61 - col. 
26, line 28, "change of velocity... change of pitch... for each label"; col. 26, 11. 29-53, "change of 
parameters in... original synthesized waveform... object for editing"; col. 23, 11. 1-16, "making 
new voice-generating information"). 

Yamazaki fails to teach wherein the edited parameters are contained in a text-to-speech 
engine configuration file. However, Miyatake teaches wherein the edited parameters are 
contained in a text-to-speech engine configuration file (col. 1, 11. 55-62, "receives text data and 
edition data... synthesizes speech corresponding to the character data"; col. 4, 11. 20-44, "pauses 
are created"; Figs. 3-4; Miyatake teaches a visual speech synthesis editing system, like 
Yamazaki, and where it is used in a text-to-speech environment). 

Therefore, it would have been obvious to modify the teaching elements of Yamazaki with 
Miyatake in order to use editing data to produce a natural sounding spoken output fi-om text, as 
described by Miyatake (col. 1, 11. 55-62 and 11. 20-30). 

Regarding claims 10 and 22 . Yamazaki fails to explicitly teach wherein edits to the 
waveform adjust parameters in the segment dataset. However, Miyatake teaches this feature 
(Figs. 3-4, col. 4, 11. 20-36, "displayed characters are edited by the inputting means... inputting 
means. . .separate. . .from each other. . .pauses are created between"; inserting the pauses edits the 
waveform and adjusts the starting and ending positions [parameters] of the phonemes). 

Therefore, it would have been obvious for one of ordinary skill in the art at the time the 
invention was made to modify the teaching elements of Yamazaki with Miyatake in order to 
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synthesize a natural speech that is close to the way a human being speaks, as taught by Miyatake 
(col. 1,11.20-30). 

Regarding claims 1 1 and 23. Yamazaki fails to explicitly teach wherein the synthesized 
audio is generated from a text input. However, Miyatake teaches wherein the synthesized audio 
is generated from a text input (col. 1, 11. 55-62, "receives text data and edition data... synthesizes 
speech corresponding to the character data"; col. 4, 11. 20-44, "pauses are created"; Figs. 3-4; 
Miyatake teaches a visual speech synthesis editing system, like Yamazaki, and where it is used 
in a text-to-speech environment). 

Therefore, it would have been obvious for one ordinary skill in the art at the time the 
invention was made to modify the teaching elements of Yamazaki with Miyatake in order to 
produce a natural sounding spoken output from text, as described by Miyatake (col. 1, 11. 55-62 
and 11. 20-30). 

Regarding claims 12 and 24 . Yamazaki fails to explicitly teach wherein the text input is 
received from the user. However, Miyatake teaches wherein the text input is received from the 
user (col. 3, 11. 5-13, "inputting means... keyboard"). 

Therefore, it would have been obvious for one of ordinary skill in the art at the time the 
invention was made to modify the teaching elements of Yamazaki with Miyatake in order to use 
editing data to produce a natural sounding spoken output from text, as described by Miyatake 
(col. 1,11. 55-62 and 11.20-30). 
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10. Claims 5, 7, 8 and 17 are rejected under 35 U.S.C. 103(a) as being unpatentable over 
Yamazaki in view Mivatake et ai . and further in view of Campbell et al ("Campbell"), US 
Patent 6,366,883. 

Regarding claims 5 and 17 , Yamazaki teaches the edited parameters comprising at least 
one property selected from the group consisting of speed, base pitch, and volume (col. 25, 11. 61- 
67, "change of velocity... change of pitch"; coL 25, 11. 18-23, "velocity indicating a 
volume... manually adjusted"). 

Yamazaki does not teach comprising search cost function weights. However, Campbell 
teaches search cost function weights (see abstract, "target cost" and col. 15, 11. 28-36, "weighting 
coefficient"). It would have been obvious for one of ordinary skill in the art at the time the 
invention was made to modify the teaching elements of Yamazaki and Miyatake with Campbell 
because Campbell teaches an advantage is that the speech segments of the speech waveform 
signals in the speech waveform database can be directly utilized (col. 15, 11. 33-36). 

Regarding claim 1 . Yamazaki teaches wherein the parameters comprise at least one 
parameter selected from the group consisting of a phonetic unit label, a phonetic unit boundary, a 
pitch mark and a phonetic alignment (col. 23, 11. 50-62, "phoneme is set in each of spaces 
separated from each other with a label... 'yo'...'de'"; col. 35, 11. 21-43, "pitch label"; Fig. 33; the 
pitch label provides an indication of pitch on the display (pitch mark); phoneme labels are 
aligned with the waveform sections (phonetic alignment)). 

Regarding claim 8 , Yamazaki teaches at least one action selected from the group 
consisting of deleting a pitch mark, inserting a pitch mark, repositioning a pitch mark and 
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adjusting a phonetic alignment (col. 24, 11. 6-43, "deletion of a pitch label... adding a new 
label. . .movement"). 

Conclusion 

1 1 . The prior art made of record and not relied upon is considered pertinent to applicant's 
disclosure: 

Yamazaki (US 5,875,427) teaches voice-generating/document making apparatus voice- 
generating/document making method and computer-readable medium for storing therein a 
program having a computer execute voice-generating/document making sequence. 

12. Any inquiry concerning this communication or earlier communications from the 
examiner should be directed to Eunice Ng whose telephone number is 571-272-2854. The 
examiner can normally be reached on Monday through Friday, 8:30 a.m. - 5:00 p.m. 

If attempts to reach the examiner by telephone are unsuccessful, the examiner's 
supervisor, David Hudspeth can be reached on 571-272-7843. The fax phone number for the 
organization where this application or proceeding is assigned is 571-273-8300. 
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Information regarding the status of an application may be obtained from the Patent 
Application Information Retrieval (PAIR) system. Status information for published applications 
may be obtained from either Private PAIR or Public PAIR. Status information for unpublished 
applications is available through Private PAIR only. For more information about the PAIR 
system, see http://pair-direct.uspto.gov. Should you have questions on access to the Private PAIR 
system, contact the Electronic Business Center (EBC) at 866-217-9197 (toll-free). If you would 
like assistance from a USPTO Customer Service Representative or access to the automated 
information system, call 800-786-9199 (IN USA OR CANADA) or 571-272-1000. 
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